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Abstract 

We consider some elementary aspects of the geometry of the space 
of probability measures endowed with Wasserstein distance. In such a 
setting, we discuss the various terms entering Perelman's shrinker en- 
tropy, and characterize two new monotonic functionals for the volume- 
normalized Ricci flow. One is obtained by a rescaling of the curvature 
term in the shrinker entropy. The second is associated with a gradient 
flow obtained by adding a curvature-drift to Perelman's backward heat 
equation. We show that the resulting Fokker-Planck PDE is the natu- 
ral diffusion flow for probability measures absolutely continuous with re- 
spect to the Ricci-evolved Riemannian measure, we discuss its exponential 
trend to equilibrium and its relation with the viscous Hamilton-Jacobi 
equation. 
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INTRODUCTION 



The Ricci flow introduced by R. Hamilton [19], (sec [2, 12, 21] for re- 
views), is a geometric evolution equation which deforms the metric y of a 
Riemannian manifold in the direction of its Ricci curvature Ric{g). 

Under suitable conditions it provides the natural technique for smoothing 
and uniformizing (S,^) to specific model geometries. Prom the perspective 
of theoretical physics, the Ricci flow often appears as a real-space renormal- 
ization group flow describing the dynamics of geometrical couplings. Typical 
examples are afforded by non-linear d-model theory [17] or by the averaging 
of cosmological spacetimes [10, 11]. In such a setting, in order to estimate 
the net effect of renormalization on the scaling of geometrical parameters, it 
is often desirable to establish monotonicity results for the various curvature 
functionals associated with the flow. This is an extremely non-trivial task 
and the recent results by G. Perelman [36] provide an important unexpected 
breakthrough of vast potential use in geometrical physics. Apparently in- 
spired by the dilatonic action in string theory, Perelman has introduced [36] 
the two functionals 



F[5;/]=^(i?(/?) + |V/|V^d/x,, 



and 

W[g;U,T] = [ \r(\Vfp\^ + R{P))+f„-3\ " 

L ^ / J (47rr(/3))2 

(see below for notation) depending on the geometry of the Riemannian man- 
ifold (S, g{P)) undergoing a Ricci flow evolution (3 g{(3), (3 € [0, T), and on 
the choice of a probability measure dzuiP) = (47rT(/?))-2 Vol[J:p]e-^f>dUp, 
{dUp = FoZ~^[S^]d/Xg(^) denoting the normalized Riemannian volume ele- 
ment), associated with a backward diffusion of the function /. The func- 
tional F[g; /] has, as already stressed, the structure of the dilatonic ac- 
tion, familiar in non-linear cr-model theory and in the statistical mechanics 
of extended objects. W[g; ffs,T] is basically a scale-invariant generaliza- 
tion of F[g; f] associated with the introduction of the scale parameter t(/3), 
which controls the localization properties of the measure dw{P). The ba- 
sic property of F[g; f] and W[g;fp,T] is their weakly monotonic character 
along Ricci flow trajectories, a fact that has been put to use by Perelman 
[36, 37, 38] in his work on the proof of Thurston geometrization con- 
jecture. Such monotonicity properties and a few formal similarities with 
standard entropies in statistical mechanics accounts for the attribute en- 
tropic. Further justifications come from a closer look into the structure 
of F[g; f] and W[g; ff3,T]. In this connection, the functional W[g;fi3,T] 
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is particularly interesting since, as is easily checked, it contains a nat- 
ural combination of the relative entropy S'[(iCT(/3)||(in^] associated with 
the pair of measures (dll^, dro (/?)), of the corresponding entropy generat- 
ing functional /[dro(/3)||dn^], and of the dtx7(/3)-localized curvature average 
< R{P) >(i-c}j{0)=Jj^ RiP)d'cu{P). The particular form of their combination 
in W[g; r] is strongly suggested by the theory of logarithmic Sobolev in- 
equalities [36, 18]. Note that < R{P) >dw{i3) enters as a defective parameter 
setting the size of scalar curvature over the region where dw{f3) is localized. 
It is interesting to remark that none of the constituents of W\g] fp^r] has, 
by itself, any manifest monotonicity property along Ricci flow trajectories, 
and it is just their overall interaction in W[g] //j, r] that makes the shrinker 
entropy monotonic. Various authors [16, 30, 31, 32] have exploited the 
strategy suggested by Perelman's construction and succeded in specializing 
or extending W[g; fp^r] to other specific settings. However, the natural 
question of the monotonicity of the constituent entropic functionals gener- 
ating W[g; //3,r] does not seem to have received particular attention. Such 
an analysis is relevant to the physical applications of the Ricci flow and 
also for a deeper understanding of the properties of Perelman's shrinker en- 
tropy. In this paper we discuss such an issue in connection with the volume- 
normalized Ricci flow (such a choice being motivated by our long-standing 
interest in cosmological applications of the theory). Our main results are 
twofold. The analysis of the volume-normalized version of the Hamilton- 
Perelman flow easily shows that by a natural renormalization r(/3) i— > r(/3) 
of the scale parameter t{0) one can make explicitly monotonic the curvature 
term t(/3) < R{I3) >dw((3} appearing in W[g; fp, r]. In this way, we can con- 
nect the growth properties of scalar curvature to the localization behavior 
of dTu{P). This strategy suggests also that, by deforming Perelman's back- 
ward diffusion {dw{P)}f3^T ^ {d^{P)} f3<T by adding a suitable drift term, 
one may get also monotonicity for the corresponding renormalized relative 
entropy S'[dil(/3)[|(in^]. Quite remarkably, the answer is in the affirmative 
and the resulting deformation is provided by a Fokker-Planck (backward) 
diffusion {dQ.t}t>o^ t = P* — P, with a drift term generated by the scalar 
curvature fluctuations. We show that, in a well-defined sense, this Fokker- 
Planck process is the natural diffusion along the (volume-normalized) Ricci 
flow. The analysis of {dQ,t}t>o shows that we are dealing with a gradient 
flow generated by a (weakly) monotonic relative entropy ^[(iilt jjiillt]. As an 
elementary consequence of such a monotonicity, we prove that if the Ricci 
curvature is positive then one gets exponential convergence of {dUt}t>o *° 
dUf 
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Our analysis relies on a remarkable parametrization of diffusion processes 
suggested by F. Otto [34, 35] related to the use of the Wasserstein met- 
ric on measure spaces [40], (see [1] for an in depth and very informative 
presentation of the whole subject). The distance induced by such a metric 
provides a way of turning the space of probability measures on a Riemann- 
ian manifold into a geodesic space, and has recently drawn attention in 
attempts of extending the notion of Ricci curvature to general metric spaces 
[26, 28, 40, 46]. The preliminary results presented in this paper point to 
the possibility, recently advocated also by J. Lott and C. Villani, that the 
use of the geometry of the space of probability measures and of the associ- 
ated notions of optimal transport and Wasserstein metric may also play a 
significant role in Ricci flow theory. ( Added in the arXive version v3: impor- 
tant developments relating Wasserstein distance and Ricci flow theory have 
been recently considered also by R. J. McCann and P. Topping [29], useful 
remarks in this connection are also discussed in the nice monography [42] 
by P. Topping. The enlightening book by C. Villani [45] provides a most in- 
spiring analysis of the deep interplay between optimal transportation theory 
and Riemannian geometry). 

Outline of the paper. We star by recalling a few basic properties of the 
space of probability measures on Riemannian manifolds. In section 1.1 we 
discuss the geometry of such a space from the point of view advocated by 
F. Otto. In particular we analyze the case when a curve of probability mea- 
sures covers a fiducial curve of Riemannian metrics. The natural framework 
for such a discussion is not a fixed probability space but rather a bundle 
of probability measure spaces over the space of Riemannian metrics, where 
each fiber is a probability space with a distinguished reference measure, 
(the normalized Riemannian volume element), metrized by the Wasserstein 
distance with cost function determined by the given Riemannian distance 
function. This is the situation occurring in the application of the formalism 
to Ricci flow theory. To our knowledge, such a general framework is not 
explicitly discussed in the existing literature on optimal transportation the- 
ory, and thus we pause a little bit for analyzing it in some detail. In such 
a setting we discuss explicitly the properties of gradient flows in the bundle 
space of probability measure. We conclude (section 1.2) this long overview 
of the probabilistic formalism by recalling the characterization of Wasser- 
stein distance and its interplay with the relative entropy and the entropy 
production functionals. The connection with the formalism developed by 
Otto comes by when discussing the characterization of Wasserstein length 
of a curve of probability measures and its geodesic nature. Here we ana- 
lyze in some depth the extension of the notion of Wasserstein length of a 
curve of probability measure to the case when the curve in question cov- 
ers a reference curves of Riemannian metrics. Again, this case does not 
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appear in the standard literature and needs to be explicitly addressed. In 
particular, we emphasize that the characterization of Wasserstein geodesies 
as solution of a Hamilton-Jacobi equation cannot be trivially extended to 
this more general case. In our opinion this is a basic issue to be solved in 
order to apply optimal transportation theory to the Ricci flow. We com- 
ment on a possible approach to a strategy for a solution in the final part of 
the paper. In section 2 we discuss the Perelman coupling for the volume- 
normalized Ricci flow. Section 2.1 recalls a few properties of the shrinker 
entropy, some really well-known, and a few others not so easily spotted 
in the existing literature. In particular, by elaborating on a remark by A. 
Figalli, we explicitly show that the shrinker entropy is an entropy balance 
functional basically generated by the time derivative of the relative entropy 
associated with Perelman backward heat equation. In section 2.2 we prove 
that there is a natural combination of a scale parameter and of the average 
curvature {R)d'U7 which is weakly-monotonic along the Ricci flow. This cur- 
vature entropy gives rise to a useful dro-averaged Harnack-type estimate. 
In section 3 we introduce the relation between Ricci flow and Fokker-Planck 
diffusion. This exploits Otto's parametrization of a probability measure by 
introducing a potential for scalar curvature fluctuations (section 3.1). Such 
a potential has a familiar counterpart in the Ricci flow theory for surfaces, 
and plays a distinguished role in our analysis. In particular we use it for 
estimating the Wasserstein length of the curve of normalized Riemannian 
volume elements along the Ricci flow. In section 3.2 wc exploit the formalism 
so developed for discussing, under very general conditions, the evolution of 
an absolutely continuous curve of probability measure along the (backward) 
Ricci flow. In section 3.3 these results are used to prove that Fokker-Planck 
diffusion is the natural diffusion of a probability measure along the Ricci 
flow. In particular, the associated relative entropy is weakly-monotonic 
and the flow is gradient-like. We also show that the flow of the associ- 
ated Radon-Nikodym derivatives (with respect to the evolving Riemannian 
measure) is a true gradient flow with respect to Otto's inner product. We 
also emphasize the associated contraction properties in the corresponding 
(quadratic) Wasserstein distance, and their role in discussing the trend to 
eqTiilibrium for the Fokker-Planck diffusion. In such a setting one naturally 
discovers that the associated relative entropy is displacement-convex. A 
property, this latter, which is strongly reminiscent of the characterization 
of Wasserstein geodesies in the space of probability measures. This point is 
discussed by showing that Fokker-Planck diffusion along the backward Ricci 
flow can be equivalently rewritten as a viscous Hamilton-Jacobi equation, 
where the viscosity parameter is related with the lower bound of the Ricci 
curvature. The paper concludes with an appendix stressing a few basic dif- 
ferences between Fokker-Planck diffusion and Perelman diffusion along the 
backward Ricci flow. 
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1 Probability measures on Riemannian manifolds 

Throughout this paper, S will denote a smooth three-dimensional manifold, 
which we assume to be closed and without boundary. We let C°°(S,R) 
and = C°°(i;, (8)+ T*!!) be the space of smooth functions and of smooth 
definite positive symmetric bilinear forms on S, respectively. is the 

group of smooth diffeomorphisms of E, and TZiemiT) = C°°(S, ®\ r*E) is 
the space of all smooth Riemannian metrics over S. The tangent space , 
T(Y, g-^7i,iem{T,), to TZiemiT,) at (E,^) can be naturally identified with the 
space of symmetric bilinear forms C°°(E, 0^ T*E) over E. It is endowed 
with the pre-Hilbertian L'^(S,g) inner product defined on {T,,g) by 

(1.1) {W, y)i2(s) = ^ / /™ Wik Vimdllg, 

VF, V G C°°(E, (g)^ r*E) being (square-summable) . The hypothesis of smooth- 
ness has been made for simplicity. Results similar to those described below, 
can be obtained for finite Holder or Sobolev differentiability. In such a frame- 
work. Let d^g, Vol [E]^, and dUg = Vol [E]~^ dfig respectively denote the 
Riemannian density, the volume, and the corresponding normalized mea- 
sure on a Riemannian manifold (E,p) G 7?.iem(E). In what follows, we 
will often refer (rather informally) to the bundle tt : Pro6(E) 7?.iem(E) 
of all Borel probability measure on E, which are absolutely continuous 
with respect to normalized Riemannian volume element dllg. Each fiber 
Proh{T,,g) = 7r~^(E,5) is endowed with the topology of weak convergence, 
and can be parametrized by the set of all Radon-Nikodym derivatives with 
respect to dlig, i.e., 

(1.2) Prob{^,g) = dUg : N € Cb(E,M+), J N dUg = l| , 

where Cb(E, M+) is the space of positive bounded measurable functions [15], 
(again, we often restrict our analysis to the smooth functions in Cf,(E,M+)). 
To avoid notational prolixity, given a probability measure dzu on {T,,g), we 

shall write for simplicity dzu G Prob{T,,g) to actually mean (^-^^ dUg G 
ProhiTi,g). Moreover, for later convenience we shall restrict our attention 
to smooth probability measure with finite fc-th moments, k > 1, i.e., we 
assume, (but this is not strictly necessary as long as (E, 17) is compact), that 
the Radon-Nikodym derivative N satisfies f^[dg{x, y)]^NdIlg < 00 for some 
(and hence all) x G (E,(?), where dg{x,y) denotes the Riemannian geodesic 
distance in (E,y)). Typically we set k = 2. 
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1.1 Otto's parametrization 

As suggested by F. Otto [34], (sec also the remarkable paper [35] on which 
this section is based and from which I extracted many observations), when 
discussing probability diffusion semigroups on a Riemannian manifold (S,^) 
it can be profitable to consider each fiber Pro5(S, g) G Prob{T,) as an infinite 
dimensional manifold locally modelled over the Hilbert space completion of 
the tangent space 

(1.3) TjvPro6(S,£/) = |/i e C6(S,M), y /iiVdng = o|, 

with respect to the inner product defined, at the given Radon-Nikodym 
derivative N, by the Dirichlet form 

(1.4) iv^^Oig,N) = ^ (5^' Vfc(^V,c) NdUg , 

for any (/?, G C^(S,]R). (Recently, this matter has been discussed from 
a geometric point of view in a series of papers by J. Lott and C. Villani, 
[24, 25, 27]). Under such an identification, one can represent vectors in 
T^ProbCE, g) as the solutions of an elliptic problem naturally associated 
with the given probability measure N dUg according to 

(1.5) {h,N) e TNProb{T.,g) x Prob{J:,g) 1 — ^ ip e Cb{E,R)/R, 

where, for any given pair (h,N), the function tp is formally determined on 
the given (S,^) by the elliptic PDE 

(1.6) -V'{NVii;) = h, 

under the equivalence relation identifying any two such solutions differing 
by an additive constant. In general, such a characterization is somewhat 
heuristic, at least in the sense that its validity must be checked case by case, 
(a particularly clear and deep analysis of the whole topic is discussed in [1]). 
As we shall see, it applies in our setting, and it provides a useful framework 
for discussing the entropic aspects of the volume-normalized Ricci flow. 

There is a further aspect about the geometry of Pro6(S) which it will be 
useful to have at our disposal. First, note that the tangent space to the 
bundle ProbCE) at {g, N dUg) can be decomposed as 

(1.7) T^g^Nau,)Prob{i:) = TgUiemiT) ®Tn Prob{T,,g) , 

a decomposition which, since TZiem{T,) is contractible, extend to the whole 
tangent bundle T Prob{T,). Let F : [0, 1] 9 A 1-^ gabW be a smooth curve 
of metrics in niem{^), with Vol i^,giX)) = Vol (S, c/(A = 0)), VA G [0, 1]. 
By means of the corresponding one-parameter family of normalized volume 
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elements dUx = VollEj^^^^diigf^x) ^ Pfob{T,,g{X)), the curve F naturally 
lifts to a corresponding curve in the bundle Prob{T,), 



(1.8) 



[0,1] — ^ Prob{T) 
A ^ {gab{\),dJlx) . 



The tangent vector G T(^g(^X),imx) Prob{T,) to such a curve at any given value 
of the parameter A, say A = ,s, can be readily characterized, in analogy with 
(1.6), if we parametrize -^dHxlx^g in terms of a potential 6^ obtained as 
the solution of the elliptic equation 



d 



(1.9) g''{s)Vi (dn, VkOs) = - g^dUx 



d 



dn,5''=(A)— 5^fe(A) 



where V denotes the covariant derivative with respect to the metric gabi^)- 
Since -§)^gik{^) ^ ^p(a)^^^"T'(^)) the equation (1.9) is formulated in the 
tangent space r(g(A),dn>^) -Pro6(S), (this is the reason why we have expressed 
it in terms of the measure density dlix rather than of the corresponding 
Radon-Nikodym derivative). Note that dlix is covariantly constant over 
the corresponding (S,g(A)), 



(1.10) 



VidJlx = dIixg''\X)Vigab{\) = ^, 



(this is equivalent to the familiar formula di In \J g(X) = (5^r?,(A), where 
r"p(A) are the Christoffel symbols associated with gab{X)), thus we can 
rewrite (1.9) as A 65 = — g^''{X)-§x 9iki^)\s^ where A = 5**^(A)ViVfe denotes 
the Laplace-Beltrami operator on {Il,g{\)). 



It must be stressed that the family of potentials 0^ depends on the chosen 
curve of metrics A ^ gabW, and not only from the associated Riemann- 
ian measures A ^ ^11^. Actually, the dependence from ^11^ can be easily 
traded for the action of the diffeomorphisms group ViJ fiTi). This follows 
by observing that since we have normalized the Riemannian volume ele- 
ments dH-Xi we can apply Moser's theorem (see e.g. [4], 7.2.3) according 
to which, on a compact manifold S admitting two volume forms dji and 
dv with dfjL = fY^du, there exists a diffeomorphism : S — > S such 
that 4>*dii = dv. In our case, this implies that there exists a A-dependent 
diffeomorphism 



(1.11) 

such that 
(1.12) 



(S,5„6(A))^(S,5„5(A = 0)) 
y^^x' = ^xhj\>^) 



dUxiy') = Jicpx) dUx=oix\y^)) 
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where 
(1.13) 



^x^(^/^A) 



is the Jacobian of (p^^. In terms of the difFeomorphism (f)^, and of the metric 
9ikW, we can rewrite (1.9) as 



(1.14) 



Formahy, given a solution $5 of (1.9) and any (A-independent) smooth func- 
tion with compact support C S Co°(E,R), we have 

(1.15) / ax)du^ =[ 5''(A)ViC(x)Vfee,dn, = (c,e,), , 



where (. . . ,.. .)s is a shorthand notation for (. 



■){g(s),N=i)- According 



to (1.4) the relation (1.15) identifies {gik{\) , dUx) 1 — ^ {-§x9ikW, ©a) as the 
tangent vector to the curve A {gik{X),dI[x). The family of function Qx 
play also a fundamental role in characterizing gradient flows in the bundle 
Proh{S) when, as in our case, the inner product (. . . ,...)« varies. To discuss 
this point, let us consider an absolutely continuous curve of probability 
measures [0, 1] 9 A 1-^ dwx S Prob{Yl), covering the fiducial curve A 1-^ dUxi 
{i.e., dwx is absolutely continuous with respect to dll;^, VA G [0, 1]). Let us 
denote by ^\ the tangent vector to A dw\ defined, in analogy with (1.9), 
by the elliptic PDE 

(1.16) g'\s)Vi{dvjsVk'^s ^ 



dX 



dzui 



A=s 



In terms of the associated Radon-Nikodym derivatives , connecting the 
fiducial curve of reference measures A 1-^ dUx to the curve A 1— >• dwx, we 



have, (from the identity ^ 
d 



dvj 



A 



Ai'^dUx)), 



(1.17) 



dX 



dw\ 



dUx 



\=s 



dwg 
dYL 



Vi(*. - e,) 



which is easily seen to be interpretable in the sense 



(1.18) 



d_ 
dX 



Cix) 



( 



dzu) 



-L 



\dlix 



dlix 



X=s 

dzu, 

dn. 



dn. 



VC(x) G Co°(S). Namely, (^'a — &x) is the tangent vector, at A = s, to 
the curve of Radon-Nikodym derivatives [0, 1] 9 A (St) ^ Cb{T,,R+). 
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If we evaluate the inner product between {'^x — @x) and a generic tangent 
vector ^ G Td^Prob{'E, g{X)), then we get 



(1.19) 



(f\X)Vi{^x-Qx)yk^ 



dzu\ 



dU 



dUx 



d 



L2(dn;,) 



where ( , )L'^[d'n)^) denotes the standard L?{dllx) inner product on (S, (/(A)). 
Similarly, if we denote by GradJ^ the gradient of a smooth functional T : 
Pro6(S) — > M, with respect to the inner product (...,.. .)dro^, then we 

compute 
(1.20) 



{GradT, 
/ 5^'=(A)ViGrad.;^Vfce 



Sa 
V,; 



"'^%^'=(A) VkGradJ^ 



dHy 



dlix 



dUx 



dllx 



Note that both in (1.19) and (1.20) we have explicitly inserted the A- 
dependent metric (A) in order to make it clear that the inner product 
(...,.. .) dz^x depends from the curve of Riemannian metrics A gabW- 

When is identified with the relative entropy functional [15], 

f /E^i^fe^n, if dzu«du. 



(1.21) 



S [dzu II dUg] = < 



00 



otherwise. 



where dzu « dUg stands for absolute continuity, the relations (1.19) and 
(1.20) allow to characterize a class of flows A 1-^ -^^^ which will be important 
in what follows. We start computing Grad S[dw \\ dUg] in such a case. To 
this end, let us consider a linearization of in the direction of the generic 
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vector ^ G Tdru Prob{'E,g), i.e., 



(1.22) 



dzu , , . dw 



with ^ parametrized a ' la Otto, 
(1.23) C = -V^ 



dzu 



The corresponding hnearization of S[dw \\ dUg] in the direction ^ provides 



(1.24) 



D S[dw II dHg] o^= —Se[dw II dHg] 



€=0 



duj \ 

1 + In — V' 
s V dUgJ 

V' In^ViTdUg 

s dUg 



dzu 



V,;T 



which imphes that the gradient, with respect to (...,. . ■)d-cj\(mgj given by 

dzu 



(1.25) 



Grad S[dzu \\ dUg] = In 



dUn 



With these preUminary remarks along the way, given the flow of reference 
Riemannian metrics A i-^ gabWi let us consider a curve [0, 1] 9 A i-^ G 
C~(S,M) n Cb(E,M+) whose tangent vector (*a - ©a) is such that 



(1.26) {{^x - ex), + {GradS[dzu \\ dUg], 0^_^ = , 

G Td^Prob{T,,g{X)). Note that, according to (1.19) and (1.20), such a 
condition is equivalent, in the L'^{T,, dUx) sense, to the PDE 



(1.27) 



d_ 
dX 



dzux 



dzu 



A ik 



dUx 



g'^X) VkGradS 
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where GradS is a shorthand notation for Grad S[dw || dUg]. If we insert 
in this latter condition the expression (1-25) for GradS, we get the Fokker— 
Planck equation 

An elementary computation shows that 
(1.29) ^ S[dwx II cMa] = ((*A - Qx), S[dwx \\ dU^])^ 
Thus, if we set ^ = S[dw\ \\ dll\] in (1.26), we get 



(1.30) S[dzux II dUx] = - {GradS, GradS)d^ , 

which implies that (1.26) is the condition for the curve [0, 1] 9 A G 
C°°(S,M) n Cf,(S,IR+) to be the gradient flow of the relative entropy func- 
tional S[dwx II dll.\\, with respect to the A-varying inner product (, )cLw\dn ■ 
Note that, in the L^(i;, (inp)-sense, such a gradient flow condition implies 
that the Radon-Nikodym derivatives evolves according to the Fokker- 
Planck diffusion (1.28). 

The relation (1.26) is an elementary but important property of the evolution 
of the relative entropy functional S[dwx || ^11;^] along a fiducial curve of 
Riemannian metrics A QabW- It is ultimately related to Moser's theorem. 
To disclose the rationale underlying this latter remark, let us recall the well- 
known fact that, for a given fixed metric g, the gradient flow {dwx-idXlg)^-^^ 
of S[dwx II dlig] is the standard heat flow on {Ti,g), [43, 44, 45]. Along the 
same vein, let us observe that, along a fiducial curve of Riemannian metrics 
A ^ QabWi we can write 

f dwx , dwx 



dw\ , dw\ .„ f dwx , dUx ,„ 

m — — d\lx= 



(1.31) 51... 11^.]=/^^ in 

^ I dvjx dwx ^jj^ - / 

Jt. dnA=o dnA=o Jt. (I^\=o dnx=o 

= S[dwx\\dUx=o]- I -^^ln{J{cP^))dIlx=o, 

where we have exploited Moser's theorem in the form (1.12). Thus, the 
relative entropy functional S[dwx \\ dUx] of a diffusion process {dzux)x>=oj 
with respect to a A-varying reference measure {dIlx)x>=o, is the sum of the 
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relative entropy of {dwx))^y^Q with respect to the fixed Riemannian volume 
clement dIlx=Q, plus a forcing potential term provided by In (J((/);^)). It is 
well-known that the gradient flows of relative entropies with forcing poten- 
tials typically yield for a Fokker-Planck diffusion [44, 45]. Obviously, this 
hcTiristic explanation must be taken with care, because of the A dependence 
in the potential term lia{J{(j)x)). Nonetheless, it provides a natural frame- 
work for understanding the subtle interplay between the diffusion of prob- 
abilty measure on Riemannian manifolds evolving along a geometric flow 
A 1-^ (E,(/(A)). Indeed, a central theme of this paper is that Fokker-Planck 
dynamics has remarkable geometric properties exactly when the fiducial 
curve of reference measures A dll\ is generated by the (backward) Ricci 
flow. 



1.2 Relative entropy and the Wasserstein distance 

As we have seen above, the relative entropy functional S[dm\ \\ dUx], de- 
fined by (1.21), has a distinguished role in disclosing the interplay between 
Riemannian geometric flows and diffusion processes {dtu\)\>o. In particu- 
lar, the approach to relative equilibrium {dw\)\>o ^ dllg is often controlled 
by a logarithmic Sobolev inequality (LSI) [15, 18, 35, 43], that can be con- 
veniently expressed [15, 35], in terms of S[dwx \\ dUx], as 

(1.32) LSI{p; B) = —{I[dw \\ dUg] + B) ^ S[dw \\ dUg] > 0, 

2p 

where p > and B >0 are constants depending on the underlying geometry 
of (S,^), and 

(1.33) I[dw II (Mg] = J 

is the entropy production functional (or Fisher information [15]). In general 
we can set 5 = in (1.32), however, as will become apparent after equation 
(2.29) below, the defective form (1.32) has some notational advantages in 
our setting. Recall also that if we assume the Bakry-Emery criterion [3] 
Ric{g) — Hess{\ii > pg, where Hess{o) denotes the Hessian on (S,^), 
then LSI{p; B) holds on compact Riemannian manifolds without boundary 
[5, 8, 9, 15, 39, 31]. 

In this section we describe some of the basic properties of S[dw\ \\ dli\]^ 
[1], [35] that we shall need later on. Let us start by recalling that Jensen's 
inequality implies that S[dw \\ dlig] € [0,-|-oo], (this can also be checked 
directly by noticing that Sldzu \\ dUg] can be rewritten [35] as the integral of 



Vln 



dw 



dli„ 



dw 
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the non-negative function — Ij + 1). Moreover, as a function of 

the probabihty measures dw and dUg, the functional S[dvc \\ dUg] is convex 

and lower semicontinuous in the weak topology on Prob{T,, g), and S[dw \\ 
dUg] = iff dw = dUg. It can be characterized [15] by the variational 
formula 

(1.34) S[dvu II dUg] = 

= sup y^fdw- exp[/ ] dUg : / G (7^(2; M)| . 

Roughly speaking S[dtu \\ dUg] provides the rate functional for the large de- 
viation principle [15] controlling how deviant is the distribution of dw with 
respect to the reference measure dUg. In particular one has [15] (Pinsker's 
inequality) 



(1.35) S[dm II dng] > i \\dw - dUgWl^^ , 

where we have introduced the total variation norm on Prob{T,, g) defined by 
(1-36) ||dtu-dn,||^„^ = 

= sup I / fdw- [ fdUg : /gC5(S;R)1, 

11/11, <i I J 

with 11/11^ < 1 the uniform norm on C(,(S;R). This is a particular (and 
elementary) case of transportation inequalities involving S[dw || dUg] and 
the notion of Wasserstcin distance between probability measures [1], [44]. 
Let us recall that for dwi, dw2 G Proh{Ti,g)^ we define the Wasserstein 
distance of order s between dwi and dw2 as 

(1.37) DY{dvJi,dw2)= inf (// c/(a;,?/)V(droi dros) 

where H{dwi, dw2) C Prob{{'E,g) x (S,^)) denotes the set of probability 
measures on S x S with marginals dzui and dw2, i.e., such that tt{U x 
S) = dzui(U) and 7r(S x U) = dw2{U) for any measurable set C S; 
{H{dwi, dw2) is often called the set of couplings between dzui and dw2). 
Note that by Kantorovich-Rubinstein duality, we have that {dw\,d'UJ2) 
= \\dwi — dw2\\y(^^- Intuitively, DY{dwi,dw2) represents, as we consider 
all possible couplings between the measures dw\ and dw2, the minimal cost 
needed to transport dwi into dw2 provided that the cost to transport the 
point X into the point y is given by d{x,yY. The distance {dwi,dw2) 
metrizes Prob{Yl,g) turning it into a geodesic space. 

The pair {Prob{T:, g),D^) has recently drawn attention [26, 28, 33, 40, 46] 
as an appropriate setting for extending the notion of Ricci curvature to 
general metric spaces. In this connection, a particularly elegant approach 
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has been introduced in [46], by relating the (ii'-) convexity of the entropy 
functional S[dzu \\ dUg] to Ricci curvature lower bounds. Explicitly, ([46], 
Th.l), if 7 : [0, 1] 9 A ^ dwx G Prob{T.,g) is a (Pro6(S, 5), L»^)-geodesic, 
(note that, typically, 7 is not the linear interpolation of dmx and dUg), then 
a lower bound on Ric{g) is equivalent to the if-convexity of S[dw\ \\ dllg] 
along any such geodesic 7, i.e. 

(1.38) S[dwx II cfflg] < (1 - \)S[dwQ \\ dUg] + \S[dwi \\ dllg]- 

-^\{l-\)[DY{dwo,dwi)f 

iff Ric{g) > K, with K E M. Such a result again points to transporta- 
tion inequalities [6, 7, 23, 44]. For our purposes, it is sufficient to recall 
the (simpler) bound ([7] case 5 of Th.l) which always holds on compact 
Riemannian manifolds 

(1.39) DY{dw,dUg) < 2^sdiam{^,g) S[dw \\ dllg]^s ^ 

y dw G Prob{T,,g), and where diamiTi,g) = sup{(ig(x, y); a;, y G (S,^)} 
denotes the diameter of (S,(?). It should be stressed that when, as in 
our case, one has a family of Riemannian manifolds A (S,^(A)), such 
a Talagrand-like inequality is effective as long as one has some uniform 
control on diam{T,, g{X)). 

A rather direct connection between Otto's description of Probes, g), dis- 
cussed in the previous paragraph, and the (quadratic) Wasserstein distance 
DY{dwi,dw2) has been stressed by Otto and Villani [35], (see also [1] for 
a more general setting), by relating {dzui, dw2) to the geodesic distance 
associated with the inner product (1.4). This relation has been analyzed in 
detail by Lott [25], who has proved the following, (see [25], Prop. 3.3, and 
Prop. 4.24), 

Theorem 1.1. (Lott) Let w : [0, 1] 9 A dw\ G Prob{'E,g) be a smooth 
curve in Prob{T,,g) with tangent vector ^x, defined for any fixed X = s by 
the elliptic PDE 



(1.40) {dws Vk^s) = - -^dwx 



\=s 



and such that V^'a 0, VA G [0,1], (i.e., the curve is immersed). Let 
= Ao < Ai < . . . < Aj = 1 be a partition of [0, 1], and let 

J 

(1.41) L{w) = sup sup V {dwx.-,, dwx.) , 

Jen o=Ao<Ai<...<Aj=i ~{ 
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denote the length of the curve w in the Wasserstein space (^Prob{Tt, g), D^)- 
Then 

(1.42) Liw) = [\{^x,^x),^^f dX 

(^J^V'^x^i^xdzuxY dX. 

Moreover the curve w is a geodesic in (^Prob{T,, g), D^) if its tangent vector 
satisfies the Hamilton- Jacobi equation 

(1.43) ^ + L^ = 0, 

modulo the addition of a spatially-constant function to "^x ■ 



It is important to discuss if tlic relation (1.42) of Tlicorcm 1.1 still holds 
if the metric gab, defining the reference measure and the diffusion operator 
(1.40) in Prob{'E,g), is replaced with a curve of (volume preserving) fiducial 
metrics [0, 1] 9 A i-^ dabWi with gab{X) uniformly bounded above and below 
for < A < 1. The potentially delicate issue concerns the characterization, 
along the given fiducial curve of metrics A ^ gabW, of the Wasserstein 
distance (the distance associated with the inner product (1.4) extends in 
an obvious manner along A i— > gabi^))- Indeed, the Wasserstein distance is 
usually defined in a fixed Prob{T,^ g), (see (1.37)), with a cost function d{ , )^ 
provided by a fixed metric tensor gab- To characterize its extension to curves 
in ProbiTi), covering A ^ gab{X), let us assume, in line with the boundcdncss 
hypotheses on gabW, that there are constants C > 0, (typically depending 
only on the dimension of (T,,g{X))), and M > 0, (depending on a uniform 
bound on the geometry of {'E,g{X))), such that 

(1.44) e-CM(A,-A,) ^^^(^^y) < ^^^(^^y) < gCM(A,-A,) ^^^(^^y) 

for any points x and y in S, and any A^, Xj G [0,1], with Afc < Xj, (in 
the Ricci fiow case, (1.44) holds whenever the Ricci curvature Ric{g{X)) of 
(i;,5(A)) is bounded by M, \Ric{g{X))\ < M, [21]). For any A G [0, 1], 
denote by 

(1-45) <^A,A, : ^x, ■ 

the diffeomorphism defined by Moser's theorem, (see (1.11)), such that 
dUx = J{(l)x^Xk) d^Xk, where J{'^x,Xk) Jacobian of ^x,Xk- with 

the above hypotheses on the metric Qabi^), we assume that there arc con- 
stants C", M' > 0, (again depending from a uniform bound on the geometry 
of {T,,g{X))), such that 

(1.46) e-^'^'(^-^) J(0,,,J < J(<^,,,J < e^'^'(^^-^) J(<^,,,J 
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for any A, A G [0, 1], with A < A, (in the Ricci flow case, M' typically is an 
upper bound to the scalar curvature). Let 

(1-47) <t>^l, : Sa, , 

be the inverse of 4>\\^.- Denote by dVix. and di^\^, A^ < Xj, two probabil- 
ity measures belonging to the spaces (Pro6(S;^^.)) and {ProbCExf,)), respec- 
tively. Their pull-backs under the above diffeomorphism, (0^.^;^)* dClx^ and 

{4>x^x)* ^^Afc) both belong to the probability space {Prob(Tix)), and we can 
define their Wasserstein distance at A according to 

(1.48) [irxlxTd^x.^ irxlxTd^x-, a) = 

1 

where Hx = H{{(l)^^ -^y dilx^^ ("/"a -^a)* ^^-'^j ) appropriate space of 

couplings, (see (1.37)), and where dx{ , ) is the Riemannian distance in 
(S,5(A)). Since the cost function dx{ , )^ and the Jacobians J{^xXu) 
both uniformly bounded in A, (see (1.44), (1.46)), the quadratic Wasserstein 
distance [X) defined by (1.48) depends smoothly from A. In particular, 
we have 

(1.49) e-^"^"(^-^)Df (A) < Df{X) < e^" ^" ^^-'^^ (X) , 

for any A, A G [0, 1], with A < A, and for costants C", M" depending from 
the constants in (1.44) and (1.46). 

With these preliminary remarks along the way, let us assume that the abso- 
lutely continuous curve of probability measures w : [0,1] 3 X i-^ dwx G 
Pro6(E), introduced in Theorem 1.1, covers a fiducial curve of metrics 
X I— > gab{X). As above, let = Aq < Ai < . . . < Aj = 1 be a partition 
of [0, 1]. The uniform bound (1.44) implies that we can choose e > small 
enough such that 

(1.50) \dx,ix,y)-dx,_,ix,y)\< 

with ^e'^^i^j-^j-i) - l\ << 1, whenever Xj - Xj-i < e. Let dwxj G 
ProhiT^Xj) and dwxj_^ G Proh{T,Xj-]) the pair of probability measures corre- 
sponding to the values Xj and Aj_i of A. According to (1.48), we can evaluate 

the quadratic Wasserstein distance (^wxj_i, ('5^a/_i a)* ^^-'^j' ^j-i^ be- 
tween dwxj G Prob{T,Xj) and d'UJXj_^ G Pro6(S;v^._j), at A = Aj_i. Thus, we 
define the Wasserstein length of the curve : [0, 1] 9 A i-^- dwx G Prob(F,), 



,CM(A,-A,_i) _ ^ 
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covering a fiducial curve of metrics A gab{^)j according to 
(1.51) 

J 

^fl(A)M = sup sup Yl (d^Xj-i, i<PxL,xX d^^i' ^j-i) ■ 

JeN Ao<Ai<...<Aj ~[ ^ ' ' ' 

Note that if Aj_i < A < Aj is a refinement of the interval [Aj_i, Aj], then 
from the triangular inequality for (1.48) and the bound (1.49) we get 

(1.52) (dn7,^_,, (<A^^^,,p* dwy^- A,_i) < 
< i^f (dt^A._,, {<t>l]_J* dvj-^, A,_i) + 
+i^2^ ((</.-^^,,)*d^X, (0i„A,)*rf^A,; A,-i) 

+^C"M"(A-A,_,)^V^(^^_, (0rl^J*d.^,^; a) , 

which implies that (1.51) is well-behaved under refinements of the partition 
= Ao < Ai < ... < Aj = 1. 

As observed above, the length of the curve : [0, 1] 9 A dvo\ G Prob{T,), 
evaluated with respect to the inner product (1.4), extends naturally to the 
case when the curve in question covers a fiducial curve of metrics A gabW- 
It is sufficient to replace |V *Apin (1-42) with g^''{X) Vi^x Vk^x- It follows 
that one can easily adapt the proof of proposition 3.3 in [25] to conclude 
that the length of za with respect to the inner product (1.4) equals the length 
in the Wasserstein sense, i. e. , 

(1.53) Lg^^)iw) = ^ g"'{X)Vi^xVk'^xdzux^ ' dX , 

which extends the relation (1.42) to the more general case considered here. 

It must be noted that a similar extension of the Hamilton-Jacobi condition 
(1.43) for characterizing Wasserstein geodesies curves A dwx, over a 
fiducial A i-^ gab{X), is quite a non-trivial problem which (to the best of my 
knowledge) still wait for a solution. We shall comment on this point in the 
concluding part of the paper and suggest a possible strategy for approaching 
it. 
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Perelman's coupling for the volume-normalized Ricci flow 



To put the above probabilistic remarks in perspective we outline Perelman's 
characterization of the dynamics of the coupling between Ricci flow theory 
and scale-dependent probability measures [36] . Let us consider the volume 
normalized Ricci flow (3 i— > gab{P), < l3 < T [12, 19] associated with a 
metric gab on a three-dimensional manifold S 

f m9ab{P) = -2RabW) + lgab{P){Rmi:„ 
(2.1) \ 

. 9ab{P = 0) = gab, 

where Rab{P) denotes the components of the Ricci tensor of gab{P), and 



(2.2) 



[Vol{J:,gabm] 

is the averaged scalar curvature with respect to the Riemannian measure 
dHg(^l3) defined by gabif^)- 

One basic idea in Perelman's approach [36] is to consider, along the solution 
OabiP) of (2.1), a /3-dependent mapping 

(2.3) :M-^C°°(S,s,M) 

where C°°(S^,M) denotes the space of smooth functions on S^. In terms of 
fp one constructs on Hp the /3-dependent measure 



(2.4) 



dtp(/?) = (47rT(/3))-te-/(^')d/i^(5), 



where /3 i — > r(/3) € M"*" is a scale parameter chosen in such a way as to 
normalize dzu{f3) according to the so-called Perelman's coupling : 



(2.5) 



dw{P) = (47rr(/3))-i / e-f^f'Ufig^f,^ 



1. 



It is easily verified that (2.5) is preserved in form along the Ricci flow (2.1), 
and 



(2.6) 



_d_ 
dp 



(47rr(/?))"t / e-f^f^U, 



0, 



if the mapping and the scale parameter t(/3) arc evolved backward in 
time (3 G (/3*, 0) according to the coupled flows defined by 



(2.7) 



To, 
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where Ag(^) is the Laplacian with respect to the metric gab{(3), and /o, tq 
are given (final) data, (backward /3-evolution is required in order to have a 
well-posed parabolic initial value problem for (2.7)). 



If wc assume that there exist constants (depending on /?) and Cf, 

\Cp^\ < oo, such that 



(2.8) 



Jo 



then backward integration of the t(/?) equation in (2.7) along a given Ricci 
flow metric /? i — > g{P), P € [0, T), provides 

ft 



(2.9) 



r{t) 



e 3 



ro + 



where for any chosen final scale /3* € [0, T), we have set t = (3* — [3 and tq 
= T(i = 0). In terms of the adimensional variable we can equivalently 
write 

(2.10) 



1 + 



t 

r 

Jo 



63 



Whereas, in terms of the forward 



(2.11) 



r(/3) = r" 63 



2^0 



1_ e-'s^°foWs))ds^^ 
Jo 



where = r(/3 = 0). Note that the scale parameter r(i) is non-decreasing 
with t, along the backward Ricci flow, i.e. ^T(t) > 0, as long as we have 
1 - § {R{t))j^ T{t) > 0. The geometric flow 



(2.12) 



d_ 

< dt 



^QahiP) = -2Rab{f3) + yabmRW)h,, QahiP = 0) = Qab 

ift = \it)ft - V'ftVift + R{t) - lT{t)-\ fit = 0) = /o 

Tit) = 1-1 (i?(t))s,(i) r(t). Tit = 0) = TO, 



defined by the forward volume-preserving Ricci flow (2.1), /3 i-^ gabiP), 
< (3 < (3* , together with the backward heat and scale equations (2.7), 
t '-^ {ft,T{t)),t = P* — (3, characterizes the Hamilton-Perelman (volume- 
normalized) flow describing the coupling between the Ricci flow and the 
scale factorization (2.5) of the probability measure dw{t). Note that one 
can equivalently consider the system obtained from (2.12) by the pull-back 
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action, d la DeTurck [14], associated with the family of /3-dependent diffeo- 
morphisms ip : M ^ M generated by the gradient vector field V In terms 
of the pull-backs g*i^ = {^p*g)ab and /* = (<^*/) of the metric gab{P) and of 
the function fp, one can write, (see e.g., [30] for the detailed computation 
in the case of the standard Ricci flow). 



(2.13) 



' = -2i?:,(/3) + 2V„V,/* + |5:,(/3)(i?*(/3))s,, 

ft* = \^it)ft* + R*{t) - U^*{t))-\ 



d_ 

at' 



I l^*w = i-i(^*(*))E.(t)r*(*), 



where all diferential operators refer to the pull-back metric 5*^. 

It must be also stressed that the equations defined by (2.1) and (2.7) are 

based on the standard volume preserving Ricci flow and accordingly differ 
from the flows rj 1— > gab{^)j V ^ "^l^)) and rj ^ f^, discussed by Perelman 
[36], 



( 9 



(2.14) 

together with 
(2.15) 



. 9ab{ri = 0) = gab, 



-1. 



The flows (2.14), (2.15) and the ones defined by (2.7) are related by the 
usual ry-dependcnt homothctic rescaling [19, 2, 12], which maps ^^a6(^) = 

—2Rabii]) to the volume-normalized Ricci fiow (2.1), i.e., 



(2.16) 

(2.17) 

and 

(2.18) 



9ab{(3{ri)) 

m = 
rim) 



(r,=0) 



/e '(/(>;=()) 



9abiv), 

ds, 



in) 



T{r)). 
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The volume normalization makes particularly clear that, given an initial 
value To = T{t = 0), the dynamics (2.10) of the scale r(t) depends only 
on the underlying Ricci flow metric P i— > gab{P), and not on the backward 
evolution of the function ft. Thus, the localization properties of the proba- 
bility measure dzu(t), (see (2.4)) are controlled by the entropy-like quantity 
that one can form with r(t) and those scalar functionals of ft which have 
the dimension of an inverse square length. The only two such objects, of 
geometric origin on 1 1— *■ (S, g{t)), are the dcc7(t)-expectation values of | V/tp 
and of the scalar curvature R{t). This latter observation immediately bring 
us to discuss Perelman's shrinker entropy. 



2.1 The shrinker entropy 

The remarkable fact is that with (2.14) and (2.15) one can associate Perel- 
man's shrinker entropy W[g{ri); fr^,T{ri)], defined by [36] 

(2.19) W[g; f^^r] = / \t (\Vfr,f + R{ri)) + /, - sl ^ " 3 d^i^^^y 

The basic property of W^{r\)\ fr],T^] and of the related functional F[g; f] = 
Jj.{R{r]) +\V f\'^)e~f d/jLg, is that it is monotonically nondecreasing along 
(2.15) and it provides a gradient-like structure to the Hamilton-Perelman 
Ricci flow (2.12, 2.13), (however, this latter interpretation must be taken 
with care in the probabilistic framework discussed here since, as remarked, 
the geometric evolution of the metric and of f{r]) are backward-conjugated). 
Explicitly, one computes (see [22] for a very informative analysis) 



(2.20) = I 2t 



Ric + Hess - ^g 



2 



3 d^n, 



where |...|^ is the squared ^(77)-norm. Defining X(g,T) = inf/^ W[g{r)); fr,], 
where the inf is taken over all normalized /^^ one shows that inf^r^o ^(9^^) 
is actually attained and is nondecreasing along the Ricci flow. In particular, 
X{g,T) < for small r, and ^ as r \ 0, for any g on S. This basic prop- 
erty allows to probe quite effectively the geometry of {'E,g{ri)) by showing 
that the only shrinking Ricci flow solitons are the gradient solitons [36] . 

Since the equations for or f^ have the same scale invariant structure, the 
shrinker entropy and its evolution extend, in an obvious way, to the volume 
normalized Hamilton-Perelman flow (2.7), viz. 



MAURO CARFORA 



23 



(2.21) W[g-fp,T] = 



(2.22) 



dp 



W 



2t 



r(^|V/^r + i?(/3))+/;3-3 
Ric + Hess fp 



(47rT(/3))2 



2t 



2 



dn„. 



What is more interesting to note is that if we introduce the normaHzed 
Riemannian measure 

(2.23) (mf, = Vol[Ep]-Ufig^f,) 

associated with the (volume preserving) Ricci flow, then the W^-functional 
(2.21) can be be equivalcntly written as 

(2.24) Wigipyjf,] = T [l[dw{P) II dUf,] + {Rma^ip) 



-S[dw{p) II dnp]+\vL FoZ(S)(47rr(/3))-2 



3, 



where 
(2.25) 



{R{f3)) 



dro(/3) 



R{l3)dzu{l3), 



is the average scalar curvature with respect to dzu{(3), and 



(2.26) 



I[dzu{P) II dnp] = 



Vln 



dw{(3) 



dUf 



dw{P), 



(2.27) S[dw{P) II dUg] = I ln^^^dm{(3), 

respectively denote the entropy production functional and the relative en- 
tropy associated with the pair of probability measures {d'w{(3),dllp). Note 

3 

that the factor VoliT.) (47rr(/3))~2 is generated by the normalization of 
//3 to the probability measure density dxu{P)/dIli3, and one may equiv- 
alcntly write 

(2.28) S[dw{P) II dllp] - In [yo/(S)(47rr(/?))-i = 

= -Fo/(S)(47rr(/3))-t £ /^je"^" dUp . 

Note also that we can rewrite W[g{P); fp] in terms of the defective LSI 
functional (1.32), i.e. 



(2.29) W[g{l3);ff,] = LSI[{2Tm-'; (i?(/?)),^(^)] + In 



Vol{^) 
(47rT(/3))i 
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Whereas this is rather trivial consequence of the fact that the standard 
Gaussian logarithmic Sobolev inequality [18, 36] lies at the origin of the 
definition (2.19) of W[g{f5); fis], it also indicates that W[g{P); fj^] is not an 
entropy, but rather an entropy-balance functional controlling the rate of 
variation of S[dvj{l3) || dUfs] along the Ricci flow, and hence the localization 
properties of dw{(3). To prove this latter remark, let us consider, as in the 
above analysis, /3 i — > gabiP), P £ [0,r), with t = p* - (5, (3* € [0,T). In 
the backward direction, along the given volume-normalized Ricci flow, we 
have 

(2.30) -cMt = [R{t) - (i?(t))sj (Mt, 

moreover, Perelman's condition (2.7) yields the conjugate heat equation 



(2-31) ^ 



where 



d_ 
di 



dzu(t) 



dn. 



A 



dw{t) 
dlit 



^ [R{t) - (i?(i)).J , 



(2.32) 



dw{t) 
dllt 



(47rr(i))-2 l^o/[Et]e" 



-/(*) 



Since dllt is covariantly constant with respect to the Levi-Civita connection 
V associated with g{t), we can exploit (2.30) and write (2.31) as the (non- 
uniformly parabolic) probability diffusion {dw{t))t>Q PDE, 



(2.33) 



A,(,) {dw{t)) ,t = P*-(3 

dzu{t = 0) = dwQ. 
Prom (2.30) and (2.33), we get 



(2.34) 



f^S[dw{t) II dUt] = 



[R{t) - {R{t))] dw{t) 



= -I[dw{t) II dUt] - (i?(i)),^(,) + {R{t))^^ 



On the other hand, according to the evolution (2.12) of the scale parameter 
T{t), we can write 



(2.35) 



= 3 1 1 (4^r(t))i 

2^ dt yo/(s) 
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where, in the last line, we have normalized (47rr(t))2 to FoZ(S) for dimen- 
sional reasons, (and wc have introduced the factor 47r for later convenience). 
Inserting (2.36) in (2.34), we get 



(2.36) 



S[dzu{t) II dn*] +ln 



(4^r(t))i 



FoZ(S) 

I[dw{t) II dUt] + (i?(i))rf^(,) 



-(47rr(t))-iFb,/] + ^r(i)-i 



■rity 



where 
(2.37) 



F[g,f] = l^{R{t) +1V/1 



-^d 



is the standard Perelman functional associated with the volume-preserving 
flow (2.12). (I wish to thank Alessio Figalli for suggesting that a relation of 
this type should hold; the fact that the entropy functional F[g;f], for the 
standard un-normalized, Ricci flow, is the time-derivative of the entropy- 
like quantity Jj. /t e~'^* d/i^^j) has been noticed also in [30]). The relation 
(2.36) characterizes also the shrinker entropy W[g{/3); fp] as the variation, 
along the backward Ricci flow, of the functional 



(2.38) g[dw{t), dUt, T{t)] = S[d-aj{t) || dUt] + In 

= -FoZ(S)(47rr(t))-i ^ /* e'^* dn* + ^ . 

Indeed, (2.36) can be rewritten as 
d 



Vol{T,) 



3 

+ 2 



(2.39) 



= -W[g{py,M - - {R{t))^^ T{t)g . 



The fact that (2.39) has the same formal structure of the evolution equation 
(2.12) of the scale parameter r(t) further confirms that W[g{P); fp] controls, 
via the effective scale Teff{t) = T{t)g, the localization properties of the 
measure dw{t). 

It is important to remark that, according to the relation (2.34), the relative 
entropy S[dw{t) \\ dUt] is not monotonic along the Hamilton-Perelman flow. 
This is related to the fact that the evolution 1 1— > ^^j^, described by (2.31), 
is not a gradient flow for S[d'cu{t) \\ dUt]. The deviation from being gradient- 
like are due to the presence of the curvature fluctuation term R{t) — (-R(t))j. 
in (2.31), (under dti7(t)-cxpectation, this term yields {R{t))^^^ — (/^(t))^^. 
According to the analysis presented in section 1.1, a possible strategy for 
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compensating such a fluctuation term is to trade R{t) — {R{t))j.^ for a {t- 
dependent) potential, so as to transform the heat diffusion {duj{t))t>o into a 
Fokker-Planck diffusion. However, before discussing such a transformation 
in detail, there is still an elementary but interesting property of (2.34) that 
we would like to point out and concerning the role and the monotonicity 
properties of the dw{t) average {R{t))^^f of the scalar curvature. 

2.2 A renormalized curvature entropy 

To begin with, let us observe that the basic relation (2.36) becomes partic- 
ularly simple if we rescale the parameter r(t) according to 



(2.40) f{t) = TO e" 

Note that f{t) satisfies 
(2.41) 



Tit) 



1 + jT0f^{R{s))ds^^ 



|f(t) = -I (Rit))^^ Tit) . 



It is easily verified that in terms of f{t) we can rewrite (2.36) as 



(2.42) 

Since 
(2.43) 



and 
(2.44) 



I[dzuit) II (Mt] + (i?(i))rf^(,) 



S[dw{t) II dHt] + In 
dw{t) In 



(47rf(t))2 
yoZ(S) 

(47rf(t))i \ dwjt) 
FoZ(E) I dUt 



dw{t) 



Vln 



(47rf(t))^ \ dw{t) 
FoZ(S) j dUt 



I[dw{t) II dHt] 



it follows that {R{t))^^(^f.^ represents the obstruction to {dw{t))t>o for being 
a gradient -like flow. It is indeed the LSI defective parameter (R)^^ ap- 
pearing in the shrinker entropy. One can easily check that {R{t))dw{t) 
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monotonic along the backward Ricci flow, however f(t) {R{t))^^^^-^ turns out 
to be weakly-monotonic, and we have the following 

Theorem 2.1. For a given Ricci flow metric [3 i — > giP), P € [0, T), and 
for any chosen (3* £ [0, T), let t i — > dw{t), and t \ — > T{t), t = (3* — (3, be 
the solutions of (2.33) and (2.10) corresponding to the initial data dwo and 
To, respectively. Then, along the backward Ricci flow, we have 



(2.45) 
In particular. 



d_ 

at 



m{m))d^{t) =-2r(t) / dw{t)\Ric{t)\' <0. 



(2.46) roe-§-°/o^"<^W>-.'^^(i?(t)),^(,) 
is nonincreasing as a function oftE [0, /?*). 

Proof. Let us recall that along the (volume-normalized) Ricci flow /3 i— > 
gikiP), l3€iO,T) we have [12] 

(2.47) = A,(^)i?(/3) + 2\Rrcm' + ^i?(^)(i?(/3) - (i?(/3))s,), 

where Ric denotes the trace-free part Rab — \gahR of the Ricci tensor. A 
direct computation exploiting (2.47) provides 

(2.48) ^Jjw{t)f{t)R{t) = ~f{t){R{t))^, jjw{t)R{t)+ 

-T(t) [ \it)R{t)dvo{t) - 2f(t) f dw{t) Ric{t) ^ - 

Jt, jt, 

~f{t) dw{t)R{t) {R{t) - (i?(t))sj + 

+fit) J^Rit)Ag^t)idwit)). 

Since Ay(^t)R{t)d-cu{t) = R{t)Ag^i.){dw{t)) and \Ric\'^ + lR{t)^ = \Ric\^, 
we get the stated result. | 



The monotonicity of (2.46) immediately implies that 
(2.49) (i2(ti)),^(,,) > {R{t2))d^^t,) exp 



2 rt2\T0 

-To / {R{s))^^ds 

^ Jtl\TO 
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for any t2 > ti- Thus, if < 0, the measure {dw{t)}t>ti will 

diffuse in a region of (T,,g{t)) where R{t) < 0, (regardless of the sign of 
the overall average {R{t))j^J. Since t parametrizes the backward flow, this 
remark implies that regions of negative scalar curvature result from the 
Ricci flow /3-cvolution of regions with negative scalar curvature (in other 
words localized positive scalar curvature cannot evolve into negative scalar 
curvature under the Ricci flow). This is another manifestation of the fact 
that the flow prefers positive scalar curvature. 

More generally, we can rewrite the evolution equation in Theorem 2.1 as 



(2.50) 



di 



which, by factorizing the Ricci tensor in its trace-free and trace part, and 
by adding and subtracting the term |r(t) {R{t))'^^^^^, yields 

d 



(2.51) 



dt 



-2f{t) ^ Rlc(t) 



1 dzu{t) 



dw[t) 

\m m))Lit) 



Since the terms on the right hand side of this expression are all non-negative, 
we get (after dividing and multiplying by f(t) > 0) 



(2.52) 



dt 



\t) T{t)m))a^it) 



This integrates to the Harnack-type inequality 



(2.53) (i?(i))rf^(i) 



< 



where < ( < t, and i?o = R{t = 0). Since {Ro)d-mo ^^sults from the /3- 
evolution of R in the regions localized by the measure dw{t), the above 
estimate allows, as we have seen above, to compare scalar curvature at 
different times in different regions along the backward Ricci flow. 



3 Ricci flow and Fokker-Planck diffusion on Pro6(S) 

According to (2.30) and (2.34), the curvature fluctuation term i?(t) — (i?(t)) 
drives the dynamics of the Riemannian measure dHt, and obstructs the 
gradient -like nature of Perelman's diffusion {dw{t))t>o. Such a behavior is 
in line with the geometric properties of Otto's parametrization of diffusion 
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processes, along a fiducial curve of metrics, discussed in section 1.1. In this 
connection, let us recall that the curve in the space of Riemannian metrics 

(3.1) 7^^em(S) — > niem{T) 

(S,ff) ^ (S,5(/3)), 

defined by the Ricci flow (2.1), is natural in the geometrical sense since it 
is r'i//(S) -cquivariant, and always admits a solution (3 ^ {Ti^g{f3)), in a 
maximal interval < /? < Tq, for some Tq < oo, (if such a Tq is finite, 
the flow necessarily develops [19] a curvature singularity as [5 /" Tq, i.e., 
lim^^Tg [sup^-gs \Rm{x,(3)\] = oo, where Rm{f3) = {Rlj^iP)) is the Rie- 
mann tensor of (S, g{j3))). Such a geometrical naturality is the basic reason 
why, as we shall sec, the Ricci flow provides a natural class of fiducial curve 
of metrics along which Otto's parametrization turns out to be particularly 
effective. 



3.1 A potential for scalar curvature fluctuations 

Let us start by observing that the curve of Riemannian mcasTircs t h->- dUt 
can be formally considered as the lift to the bundle Prob{T,) of the curve 
in TZiemiYi) defined by the backward Ricci flow [0,/3*] 3 i h-> gab{t). In this 
way, one characterizes a fiducial curve of reference measures in the bundle 
Prob{T,), and for each t £ [0, l3*] we can naturally describe the corresponding 
fiber of Pro6(S) over gab{t), as 
(3.2) 

Probi^t) = Prob{T.,g{t)) = i^N dllt : N £ C6(S,R+), J N dUt = . 

To fully exploit such a description we need to characterize the tangent 

vector to the fiducial curve t dUf. Let us consider a generic value 
of the parameter t, say t = s. Since Jj,^ ^dllt\t=s = 0, we have that 
■^dllt\t=s G TdUtP'i'oK^)- According to (1.6), and in analogy with (1.9) we 
parametrize ■^dllt\t=s in terms of a scalar curvature-fluctuations potential 
$s obtained as the solution of the elliptic equation 

(3.3) 5''(s)Vi (dn, Vfe$,) = - ^^dUt 



t=s 



where ■^dllt\t=s is given by (2.30) for each given t = s, i.e., (again exploiting 
the covariant constancy of dUt), 



(3.4) 



Ag^,^<^s = -iR{s)-{R{s))) 
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Formally, given a solution of (3.4) and any (t-independent) smooth func- 
tion with compact support ( £ C^(S,M), we can write 

(3.5) I / Cdllt= [ C§-/^t= [ VXVi^tdnt = {C,^t)an,- 

According to (1.4) the relation (3.5) identifies dUt i — > $t as the tangent 
vector to the curve t — > dUt and defines the curvature (fluctuation) potential 
in which a probability density, evolving along a Ricci flow manifold, diffuses. 

Since j^dlis {R{s) - {R{s))) = 0, and 

(3.6) / {R{t) - {R{t))^,)^t dUt = [ |V$tp dUt 

we get that, as long as the (volume-normalized) Ricci flow exists, equation 

(3.4) admits a solution unique up to constants. The L^((E, dfl^), M) norm 
of {R{s) - {R{s))) is given by {R^{s)) - {R{s)f , thus is in the Sobolev 
space if2((5], dlls), R) if the mean square fluctuations in the scalar curvature 
are bounded. More generally, we know (see e.g. [12]) that if /3 i— gabiP) is a 
solution of the Ricci flow equation (for which the weak maximum principle 
holds), then bounds on the curvature (and its derivatives) of the initial 
metric induce a priori bounds on all derivatives \ V"^R{x, s) \ for a sufHciently 
short time. Thus, for any given t = s for which the Ricci flow is non-singular, 
we can assume that {R{s) — {R{s))) is C°°(i;,M) and by elliptic regularity 
we get that € C°^(S,]R). Along the same lines, we also have 

Lemma 3.1. For any $t G C^(S,M) solution of (3.4) the following relation 
holds 

(3.7) / \Hess^tf dnt+ [ R'''{t)Vi^t^k^tdUt = 



= {R{tY)^,-{Rm^. 

If the Ricci curvature of {Yltj gii)) is positive, then 



(3.8) / 



v^t r dUt < - 



2 {R{trh^ - {R{t))i 



3 Kt 



where Kt > is the lower bound of Ric{t). Moreover, if Ly(^t){dllt) denotes 
the (quadratic) Wasserstein length of the curve 1 1— > dUt, then we have 



1 



(3.9) 



Proof. These results are elementary consequences of the Ricci commutation 
relation VjVj / -ViV^Vjf= RijV^ f, valid for any / G C3(S,M). In 
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particular, for $t G C3(S,R), consider the expression 2V*$tViAg(()$t. The 
Bochner-Wcitzcnbock formula provides 

(3.10) 2V*$tViA^(t)$t = Ag(,) |V$t|' - 2 \Hess^tf - '2R'\t) Vi^t^k^u 
pointwise. Thus 

(3.11) -I \Hess^tf dUt- [ R'\t)Vi^t'7k^tdUt = 

= / V^$tV,A,(,)$t dUt = [ {R{t) - (i?(0)sJAg(,)$t dUt = 

where we have integrated by parts and exploited (3.4). Since 

(3.12) / \Hess<^tf d^t>\j \\(t) dUt = \{{R^),:, - 
we get from (3.7) 

(3.13) ^ {{R{t?)i:, - {R{t))l,) > I R'\t)Vi^t^k^tdnu 

which, if the Ricci curvature has a positive lower bound K^, yields (3.8). 
Note that, since positive Ricci curvature is preserved along the Ricci flow, 
the bound (3.8) holds for every t. If Lg(t^{dlit) is the (quadratic) Wasser- 
stein length of the curve t ^ dlit, defined according to (1.51), then the 
identification (1.53) and the bound (3.8) imply 

(3.14) Lg^t){dnt) = f ([ \S/ ^t\' duX dt 



as stated. 



Note that equation (3.4) has a familiar counterpart in the Ricci flow theory 
for surfaces [20], (see also [12]). It arises also in Kahler geometry where 
it provides the relation between the Kahler Ricci potential and the scalar 
curvature. Moreover, if we consider a gradient Ricci soliton Ric{g(t)) = 
Hess i.e., a fixed point of the flow obtained by quotienting the space of 
metrics under diffcomorphisms and scalings, then clearly $t = — ^t- In the 
general case, {viz., when (S,^) is neither Kahler or a gradient soliton), we 
can still obtain a geometrical characterization of We start by deriving 
an asymptotic expression for valid in local geodesic coordinates (LGC), 
{x^}lgCi ^-iiy given fixed point p G S^. Let us denote by r{x) = d{p, x) the 
Lipschitz function providing the distance from p to x. For x ^ Cut{p), the 
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cut locus of p, we set = ru^ , with coordinates on the unit sphere C 
TpT,s- The puh-back of the Riemannian measure dUs under the exponential 
mapping expp : TpTig ^s, provides the familiar asymptotics in geodesic 
polar coordinates 



(3.15) 



where dllEud. is the standard Euclidean volume element in polar coordinates 
in TpUg, and Rikip) are the components of the Ricci tensor at p. Thus, if we 

take the (Euclidean) Hessian of the function 6 +\ (-^)es 

we get 



(3.16) 



6 



LGC 



^ 5ik - Rikijp) - \ Vj Rikip) x> + 0(r2), 



and by tracing 



(3.17) 



A 



Euc 



expl{dlls) \ ^ 1 ^^^^^^ ^2 

d^Eucl. / 3 



LGC 



{R),,-R{p), 



where Aeuc denotes the standard Euclidean Laplacian. Hence, in local 
geodesic coordinates we can write 



(3.18) 



LGC 



6 



= 6 - (^Rikip) - I {R{s))j:, Sikj x'x'' - \ Vj Rikip) x'x^x^ + 0{r^). 
Note that from this latter asymptotics we can formally compute 



(3.19) {Hess^s)ik{.x^) = -Rik{p) + ^ ^ik + 0{r), 

-L/CrG O 

which shows that, around any given point p, the convexity properties of $5 
are related to the sign of the Ricci curvature. In particular, we have the 

Lemma 3.2. Let be a smooth solution of (3. 4) along a Ricci flow with 
uniformly bounded curvature operator on S x [0,/3*], then 



(3.20) 

in the barrier sense. 



Hess > -Ric{gis)) + - (i?(s))s, g{s) 
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Proof. Let Bg{p,r) C {Tis,g{s)) be a geodesic ball of radius r centered at 
the generic point p G E^. For any £ > 0, let us define 

(3.21) $(^)(u,d(p,x)) = 



1 -\ 

a 



Away from the cut locus, the function ^^^^(■u, c?(p, x)) is smooth and such 
that $i'^^(tt, = $s(p). Prom the asymptotic expression of 

in Bg (j>, r) we compute 

(3.22) ^six'') -^^sHu,r) > 

Assume that Rics{u, u)\p ^ 0, (otherwise move to the next non- vanishing 
higher order term in the asymptotics). Along a smooth Ricci flow with uni- 
formly bounded curvature operator on S x [0,P*], the derivatives IV^'') Ricig)\, 
k > 1, are bounded ([21], Th. 13.1). Thus, we can assume that the O(r^) 
terms in (3.22) are uniform in r, and we can define 



2\ /o\2 \2 



1 



(3.23) 0< r(e) = 2e ,^ „. , , , 

^ \VuRiCsiu,u)\p\ 

to the effect that 

(3.24) <^>,{x'')>^i'\u,r) 
for < r < r(e). Finally, 

(3.25) Hess^'f \u,d{p,x))\p = -Rics{u,u)\p + ^ {R{s))j.^ gs{u,u)- 

-8 (^{R^)^^-{R)iy gs{u,u). 

Thus, for r sufficiently small, ^i^\u,d{p,x)) is a lower barrier function for 
^s{x). Since the base point p G is arbitrary, it follows that Hess <I>s(x) > 
—Ric{s) + I {R{s))y^^ g{s) in the barrier sense. | 

Note that if the Ricci curvature of {T,g,g(f3)) is bounded below, i.e., if there 
is a G M such that Rik{P)u''u^ > Kfj giku'^u^, V ii : — > TE^, then the 
above lemma implies that 

(3.26) 2R,k - ^ (i?(/3))s/3 9ik + {Hess^p)ik > gik 

a relation that will be useful in discussing the approach to equilibrium for 
the Fokker-Planck dynamics associated with the Ricci flow. 
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3.2 Ricci flow evolution of probability measures 



There is a useful consequence of the above parametrization of the curvature 
fluctuations which immediately shows why Fokker-Planck diffusion is natural 
when we consider the evolution of a probability measure along the fiducial 



Lemma 3.3. For any curve of probability measures (0,/?^) 3 t i-^ (Kit € 
ProhiTi), absolutely continuous with respect to dlit, the following identity 
holds along the the backward volume-normalized Ricci flow 

'^-S [dnt II cfflt] = -/ [dUt II cfflt] + 



(3.27) 



dt 



where 
(3.28) 

and 
(3.29) 



dQt 



^^cmt - Ag(t) dnt + V' {dUtVi^t) 



S [dnt II dUt] = [ dnt In 



dnt 

dlTt' 



I [dnt II dUt] = [ dntV 



; . dnt dnt 



respectively denote the relative entropy of dnt with respect to dUt and the 
associated entropy generating functional. Moreover, one computes 

(3.30) [dnt II dUt] = 



dnt 



V In 



dnt 

dflt 



+ 2VMn— 
dUt 



d , dnt 

— In 

dt dUt 



1 dnt 



Vln 



dnt 



V^^tViln 



dnt 
dm 



+ 



+ / dnt 

'St 



-2R^\t) + -{R{t))^J\t) 



V.ln^V.ln^ 
' dUt dHt 



dnt 



R'^{t) + {Hess $t) 



ik 



dnt dnt 



-2 / d^t 



Hess I In 



dnt 
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Proof. Since by hypothesis d^t is a probabiUty measure absolutely contin- 
uous with respect to dUt, its total mass is preserved along the (backward) 
volume-normalzed Ricci flow. By factorizing dflt in terms of dUt and by 
exploiting Otto's parametrization (3.4), we get 



(3.31) 



d_ 

dt 



d fd^t 



dt \ dlif 



dRt Vi$tV^ 



dVtt 



dt 



d^lt In 



dnt 

d^t 



In 



fdnA d 



\dntj dt 

dn 

dllt 



{dQt 



Integration by parts provides the identity 
(3.32) / ln^[A,(,)dJ^t-VMc/OtVi$t)] 



'St 

- / dUt vnn ^ V, In ^ + / dnt V,$t In ( 



Kdlit 



which we rearrange as 

(3.33) £ d^t Vi$tV' In = I [dQt \\ dUt] + 



dn. 



In— ^[A,(,) dnt-v' (dntVi^t)]. 

By inserting (3.33) in (3.31) we get(3.27). The proof of (3.30) is a lengthy 
routine computation which can be performed along the following steps: 

(3.34) [dnt II dUt] = ^l' dnt g^\t)Vi In ^ V, In ^ = 

dt dt 7sf dllt dllt 



_ d 
~ di 

dn. 



dUt 



St 

dnt 



St 



St 



V In 



dnt 
dIL 



St 

dnt ^ 

dUt 

2 



dn. 



dn. 



dUt dUt 



-2R"(t) + -{R(t))^g»{t) 



„ , dnt „ , dnt 



+2 / dn,9-'(t)V,ln;5£v4lnj^. 

Jst dUt dt dUt 
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The structure of (3.27) suggests to add and subtract, to the ^ In ^ terms 
in the above expression, the quantity 

2 



(3.35) 



V In 



dQt 



- V'$tViln 



dQt 



which is the rewriting in terms of In of the generator of the Fokker-Planck 
operator 

(3.36) ^dOt - Ag^t^dQt + (dQtVi^t) , 

appearing in (3.27). Applying the Bochner-Weitzenbock formula 



(3.37) 



—2 Hess In 
and the identities 
(3.38) 



, dQt „ . , dil^t 
2V' In — -ViA„u-) In — - = A 

dUt dUt 

dVLt ^ 



■9(t) 



Vln 



d^t 



-2i?^^(t)V,In^V,ln.^^* 



dUt 





V In— r 


/ d^t 


dlit 



+ / d^tAgijt) 

JT,t 



V In 



d^t 

dm 



+ 2 / dJ^tVMn— ^Vi 



V In 

V In 



(3.39) 



dn 



VMn— ^VJ V'^^tVfcln 



dn 



dn 



dnt 

dfi; 

dnt 



1 



V In 



dnt 
dn; 

dOt 

dn; 

dl7t 2 



+ 



dn. 



(the former obtained by iterated integrations by parts and the second by 
direct computation [35]), one eventually gets the stated result. | 



3.3 Fokker-Planck diffusion along the Ricci flow 

Prom Lemma 3.3, we immediately get the following 

Theorem 3.4. The Fokker-Planck diffusion {dnt)t>o generated, along the 
backward, volume-preserving, Ricci flow, by 

(3.40) - {dnt) = Ag^t)dnt - V' (dntVi^t) , 

has the following properties: 
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(i) It is a gradient-like flow for the relative entropy functional S [d^t II dllt\j 



I.e., 



d f d^t r /dnA , /dnA 

(a) The corresponding evolution for the Radon-Nikodym derivative 
(o 49^ d dnt _ dnt i dQt 

is the gradient flow, (in the L^(S,dnt) sense), of S [dflt || dUt] with respect 
to the inner product (...,.. ■)dnt\dUt' defined by (1-4:). 

(Hi) The associated entropy production functional / [dilj || dUt] satisfies the 
differential inequality 

(3.43) ^/ [dQt II dUt] <-2KtI [d^t \\ dlit] 

where Kf G M is the (t-dependent) lower bound of the Ricci curvature. 

(iv) If along the given Ricci flow (3 i-^ OabiP) the diameter stays uniformly 
bounded, i.e., sup^>o {diam (S, g{P))} = diam < oo, then 



(3.44) S[dnt II dUt] > \ 



Df{d^t,dRt) 



diam 



and the quadratic Wasserstein distance (d^t^dHt) is weakly monotoni- 
cally decreasing along the backward Ricci flow. 

Proof, (i) The first part of the theorem, and in particular the gradient-hke 
nature of {dflt)t>o, is a direct computational consequence of Lemma 3.3. 

(a) It is easily verified that the absolutely continuous curve t i— > dQt solution 
of (3.40) induces the evolution (3.42) on the associated Radon-Nikodym 
derivatives (^^)t>o- The proof that this is a gradient flow with respect to 
the inner product (...,.. ■)dnt\dUt follows from the analysis in section 1.1, 
(see eq. (1.28)). In particular, if we denote by £ C6(S,M)/M the tangent 
vector to {dQt)t>o, i-e., the solution, for each given t = s, of the elliptic 
PDE 



(3.45) ^^dnt 



= -g'Hs)Vi{dntVk^s) , 

t=s 

then, according to (1.17) and (1.18), the tangent vector to the curve of 
Radon-Nikodym derivatives i^^)t>o, at the given value of t, is provided by 
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(^'f — $t). The gradient flow condition with respect to the inner product 
(1.4) takes the form (1.26), i.e. 

(3.46) ((*t - $t), 0«k + {GradS[dnt \\ dllt], O^lk = > 



G Tdnt Prob{T,t), which is equivalent, in the L^{T,,dUt) sense, to (3.42). 
Note that by comparing (3.45) with (3.40), one gets the relation Vj (*( — $*) 
= — Vj In (^^^ • Thus, we can equivalently rewrite the entropy production 
functional as the Otto norm of the vector {"^t — $t), i.e., 



(3.47) I [d^t II dUt] = / |V (*t - $t)p 



dQt 



(iii) Prom Lemma 3.3 we have 



(3.48) 



dQf 



St 

-2 



d_ 

dt 
2 



/ II dUt] 



-2R^'{t) + -{R{t))^^g^\t) 



ii:^*^(t) + (i/ess #t) 



-2 / drit Hess fln^ 



which, according to Lemma 3.2 and (3.26), directly yields (3.43). 

(iv) As long as the diameter diam (T,, g{P)) of {T,,g{P)) remains uniformly 
bounded along the given Ricci flow, we have the Talagrand-like inequality 
(1.39) from which (3.44) immediately follows. Note that for a Ricci flow with 
uniformly bounded Ricci curvature \Ric{l3)\ < M we have the elementary 
bound for diam (T,, g{P)), (see [41] and also [21]), 

(3.49) diam (S, 5(0)) e"^^ ^ < diam (S, g{(3)) < diam (S, 5(0)) e^^^ ^ , 

(the factor 2 is due to the volume normalization of the flow). Recently, P. 
Topping [41] has obtained an improved control on diam (T,, g{(5)) in terms 
of suitable averages of the scalar curvature. | 



In its simplest form, the rate of convergence of a solution of 
f d dflt _ A . . d(it yyiift \7 dClt 

mm; - ^9{t)drh ~ ^ **^»dn7' 



(3.50) 



dUt I 



dQp 
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to the stationary state ctQ^ is governed by a curvature condition which is 
naturally suggested by the structure of equation (3.48), and which, according 
to lemma 3.2, is equivalent to the positivity of the Ricci tensor. Note that, 
according to the characterization (3.4) of the potential also {dIlt)t>o 
solves the Fokker-Planck equation (3.40), (with the initial datum dHo), 
along the backward Ricci flow. If (dnt)t>o is a solution of (3.40) then we 
call (drit, dIlt)t>o a conjugated Fokker-Planck pair along the backward Ricci 
flow. Taking into account this elementary observation, we get the following 
result 

Lemma 3.5. Let (3 Qabifi), P G [0,00), a given Ricci flow metric start- 
ing on a manifold {T,,g) of positive Ricci curvature, and let {drit, dIlt)t>Q 
be the conjugated Fokker-Planck pair, solution of the Fokker-Planck equa- 
tion (3 AO). Then, along the backward Ricci flow, the entropy functional 
S [dQf II dUt] decreases exponentially fast according to 

(3.51) S [d^t II dUt] < S [dno \\ dUo] e^i^-f *, 

where ^Xini = iiif/3>o {K/3 > : Ric{(3) > Kpg{(3)}. Moreover, the Tala- 
grand inequality 

(3.52) s[dnt II dUt] > ^ [L>f (da,dnt)]' , 

holds, and S [dilt || dllt] is a convex function along the backward Ricci flow. 



Proof. Since the positivity condition on the Ricci tensor is preserved by the 
Ricci flow and yields long time existence [19], we have Ric{[3) > Kpg{(3), 
Kf} > c > along the flow. According to (3.26), such positivity implies the 
condition 

(3.53) 2Ric{(3) - ^ {Rm^^ g{P) + Hess > ^ <?(/?), 
on the curvature terms entering (3.48). It follows that 

(3.54) j^I [dnt II cfflt] < -^Ainf / [dOt II cMt] , 

which implies that the entropy dissipation functional / [d^t 1 1 dUt] decreases 
exponentially fast according to 

(3.55) / [d^s II dUs] < I [d^t \\ dUt] e^i^-f 

for any {s — t) > 0. Let teq (this may be finite or infinite) the value of t for 
which dQt attains equilibrium (i.e. dQt = dUt for t > teg). If we integrate 

(3.55) over s from t to tgq, 

rteq 3/2 \ 

(3.56) / / [d^s II dUs] ds < —— { 1 - e^s^i"* (^-i-^) ] I [dUt \\ dn*] . 

Jt 2 Ainf ^ ^ 
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and take into account that / [dQs II dUg] = —-^S [dOg || dUg], together with 
S [dQt^^ II cffltej = 0, and - e-f^-f i^e^-t)^ < get 

(3.57) s [dnt II dUt] < -|- / [dnt II dUt] . 

^ Ainf 

Since the time t is arbitrary, this estabhshcs that a logarithmic Sobolev in- 
equahty, of constant Ajnf, holds for the diffusion process {dnf}tyQ. By insert- 
ing (3.57) in (3.41) we immediatey get the exponential trend to equilibrium 
(3.51). Since {dnt, dIlt)t>o are a conjugated Fokker-Planck pair, according 
to [35], the validity of the logarithmic Sobolev inequality on {T,,g{t)) im- 
plies the Talagrand inequality and hence (3.52) follows. The convexity of 
S [dnt II dUt] is a direct consequence of (3.54) and (3.57) above. Explicitly, 
from (3.54) and / [dn^ \\ dU^] = -£S [dQ^ || dUg], we get 

(3.58) [dnt II dUt] > (^^ Xinf^ s [dnt \\ dUt] , 

which by Talagrand inequality (3.52) yields 

(3.59) ^s[dnt\\dnt]>^(^^Xinf^ [D^{dnt,dUt)]\ 

and S [dnt \\ dUt] is t-displacement convex along the Ricci flow. | 



Note that if we introduce the adimensional variable r) = ^Xinft then (3.59) 
can be equivalently rewritten as 

(3.60) [dnt II dUt] > ^Xinf [D^idnt, dno] ' , 

which is equivalent to Sturm's i^-convexity [46] of S [dnt || dUt], (for K = 
^Xinf), (see (1.38)). The point here is that, typically, the JT-convexity of 
a relative entropy functional holds along the Wasscrstcin geodesies of the 
mctrizcd probability space (Pro6(S,(?), D^( , )). In particular, it was 
conjectured to hold for Riemannian manifolds with non-negative Ricci cur- 
vature by F. Otto and C. Villani [35], (in the case K = 0, the conjecture 
has been proven in [13] whereas in the general case ( for any ii' € M) in 
[46]). These remarks suggest that the Fokker-Planck diffusion {dnt}t>o is 
strictly connected with Wasserstein geodesies in the bundle ProhiT,). To 
discuss to what extent this is the case, let us recall that, at any given 
the tangent vector to the curve of Radon-Nikodym derivatives t ^ is 

provided by — In (see (3.47)). In analogy with the characterization 

of the parameter r characterizing Pcrelman's diffusion {dTu{t))t>o, let /ig 
denote the (squared) length scale over which the probability measure dnt=Q 
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3 

is concentrated. Roughly speaking {Atthq)^ is the typical volume of a do- 
main B C whose (ii^o-measurc is not exponentially small and yields for full 
measure if slightly blown up. Let us consider the adimensional parameter 



(3.61) £ = ^(47r/Xo) 



3 



and let us e-rescale the vector — In {^^^ according to 



(3.62) U£(£i) = -2£ln 



Prom equation (3.42) it easily follows that {u^(ety)f>Q evolves according to 
the viscous Hamilton-Jacobi equation [44, 45] 

(3.63) ^ + ^ = e (A^(,)u, - V^$tV,n,) , 

with (a smooth) initial datum Uc{x,t = 0) = Ue{x), x G S. When the 
parameter e defined by (3.62) is small, one may discuss the solution of (3.63) 
by the so-called vanishing viscosity method. Qualitatively, this implies that 
when £ — > 0, the rescaled vector approaches the Hopf-Lax solution 

Ueiy) + ^dt{x,yf ,t>0, xGS, 



(3.64) inf , „ 

^ ' 2/es '^'^^ 2t 

of the Hamilton-Jacobi equation 



(3.65) 

[ Ue{x,t = 0) = Ue{x). 

This is admittedly rather vague since, in our setting, the distance dt{x,y) 
varies with t, along the backward Ricci flow, and the viscosity solutions 
must take this dependence into account. However, for e << 1, i.e. if the 
probability measure dilf is initially concentrated on a set which is small 
with respect to the radius of curvature of (T,,g{t = 0)), the Fokker-Planck 
diffusion {dQ,t)t>o, behaves, for t sufficiently small, as if occurring in the 
fixed probability space Prob{'E,g{t = 0)), with a shadow of the Ricci flow 
still present through the forcing potential ^t=o- In such a case, the viscosity 
interpretation of (3.64) is more justified and, according to theorem 1.1, (see 
(1.43)), one can reach the conclusion that {dQ.t)t>o, for t small enough, 
approximates a geodesic in the Wasserstein space {Prob{Yl, g{0)), D^i , )). 
From a more geometrical point of view, one is here approximating the curve 



tt-^^ with the push-forward of ^ 



under the action of the semigroup 

=0 

defined by the infinitesimal generator — i|V?7ep, (where the norm is taken 
with respect to (S,y(i = 0))). It is a known fact that, whenever one moves 
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a measure density through the push-forward action of the exponential of a 
smooth function, one gets a geodesic in the appropriate Wasserstein space, 
(again, I wish to thank A. Figalh for useful remarks in this connection). 

What is missing in such an approximation argument is the explicit role 
of the backward Ricci flow. In this connection, the basic step to take is 
the characterization of Wasserstein geodesies not just in a fixed probability 
space (Pro6(S,5'), DY{ , )), but rather in the bundle Pro6(S) over the 
space of Riemannian metrics 7^iem(S). Roughly speaking, we expect that 
the Hamilton-Jacobi condition (1.43) of theorem 1.1 is an approximation to 
a more general geodesic equation in Pro6(S), more or less like straight lines 
approximate geodesies in Riemannian geometry. Correspondingly, the Hopf- 
Lax representation will be an approximation to the exponential mapping in 
ProhiTi). These remarks are strongly supported by the fact that, as we have 
seen above, the natural diffusion process along the backward Ricci flow is 
the viscous Hamilton-Jacobi equation (3.63), where the viscosity parameter 
£ is naturaly characterized by the lower bound of the Ricci curvature. Here, 
we see a recurring theme in the Fokker-Planck dynamics of the conjugated 
pair {dVlt, dIlt)t>Q along the (backward) Ricci flow: (i) The Ricci curva- 
ture controls the geodesic convexity for the corresponding relative entropy; 
(ii) It parametrizes the viscosity solutions of the Hamilton-Jacobi equa- 
tion associated with the Fokker-Planck diffusion; (in) It naturally affects 
the logarithmic Sobolev inequalities controlling the Wasserstein distance be- 
tween (dOj, dll.t)t>0- Points of contact between diffusion, geodesic convex- 
ity, Hamilton-Jacobi theory, and LSI are well-known (see e.g., [44, 45] for 
a discussion and relevant references), however here their relation seem to 
come to full circle. In our opinion, this is a serious indication of the exis- 
tence of deeper connections between the geometry of optimal transport and 
Ricci flow theory. In particular such connections point to the possibility of 
adopting the geometry optimal trasportation for extending the Ricci flow to 
metric spaces more general that Riemannian manifolds. 
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It is worthwhile to compare the Fokker-Planck diffusion (3.40) with Perel- 
man's flow (2.7). If we apply lemma 3.3 to {dw{t))t>o we immediately get 
the following 

Lemma 4.1. For a given Ricci flow metric (5 i — > giP), P & i^^T), and for 
any chosen /?* G [0,T), let t i — ^ dzu{t), t = (3* — (3, he a solution of the 
parabolic PDE Then, the relative entropy functional 



(4.1) S[dw{t)\\dllt\= f dw{t)\u 



dxu(t) 

varies along the fiducial flow 1 1 — ^ dllt according to 



(4.2) ^ [ dwit) In ^ = -/ [dw{t) II dllt] 

at jY,f di-h 



/st dllt 



Note in particular that S [dw{t) \\ dUt], as compared to S [dVlt \\ dUt], is 
not weakly monotonic. The term responsible for such a lack of monotonicity, 
(and of the fact that (2.33) is not the gradient flow of S[dvj{t) \\ dUt]), is 

the scalar product (^t)^^^r^/ • If we replace dw{t) with dQ.t, this is 

basically the drift term driving Fokker-Planck difi^usion in (3.40). Such a 
term describes also how the measure dw{t) localizes the fluctuations in the 
scalar curvature along the Ricci flow. 

Lemma 4.2. For a given Ricci flow metric (3 i — > g{(3), (3 € [0, T), and 
for any chosen [3* G [0,T), let t i — > dzu{t), t = (3* — (3, he a solution 
of the parabolic equation (2.33^ corresponding to the initial datum dwo = 
{dwo/dIlfj*)dIli3* . Then 



(4.3) 

and 
(4.4) 



dt 



dMt)_\ 



dwit) \ 



dw{t) {R{t) - {R{t))) , 



2 J dw{t) ^ Ric{t) ^ - ( mcit) 



I jjw{t)R{t) [R{t) - (i?(t))Ej - \[{Ritf)i:. - {R{t))U. 
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Proof. Relation (4.3) follows from (3.4) by a straightforward integration by 
parts 



(4.5) 



= y dw{t) {R{t) - {R{t))) . 



i [{RWfh, - {Rm%) 



The evolution equation for scalar curvature (2.47) provides 
(4.6) -^{Rm^^ = 2{\Ric{P) 
From these latter relation, (2.47) and (4.3) one directly computes 
(4.7) 



d / dMfy\ 
dt \ *' cfflt / 



dlit 



y R{t)/^{dw{t)) + y dw{t)^R{t) - I {R{t)) 



/s 
= -2 



dw{t) \Ric{t) - { Ric{t) )st - 



y dw{t)R(t) [R{t) - (i?(t))sj + 



which provides the stated result (4.4). 



Note in particular that if we choose for (2.33) the initial datum dw{t = 0) = 
dUp* we get, 



t=0 



which again shows the role that mean square fluctuations in scalar curvature 
have in controlling the concentration mechanism of the measure dw{t). We 
can trace here the difference between the standard Perelman flow charac- 
terizing the backward diffusion of dw{t) and the Fokker-Planck diffusion of 
dVlf. The former feels curvature fluctuations in a more indirect way as a 
forcing effect deforming the trajectory of dw{t) in Prob{T,). Such a forcing 
behavior is made manifest by the fact that the evolution of dw{t) is not the 
gradient flow of the associated relative entropy, and, according to lemma 4.1, 
the failure of being gradient is exactly provided by the term ^WT"") ^-^ ■ 
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Conversely, the diffusion {dClt)t>o has the curvature fluctuations taken care 



which renormalizes Perelman's {dw{t)}t>o into the Fokker-Planck diffusion 
{dnt}t>o- 
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